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SUMMARY 

Generalized  mean  squared  error  is  a flexible  measure  of  the  adequacy 
of  a regression  estimator.  It  allows  specific  characteristics  of  the 
regression  model  and  its  intended  use  to  be  incorporated  in  the  measure 
itself.  Similarly,  integrated  mean  squared  error  enables  a researcher  to 
stipulate  particular  regions  of  interest  and  weighting  functions  in  the 
assessment  of  a prediction  equation.  The  appeal  of  both  measures  is  their 
ability  to  allow  design  or  model  characteristics  to  directly  influence  the 
evaluation  of  fitted  regression  models.  In  this  note  an  equivalence  of 
the  two  measures  is  established  for  correctly  specified  models. 
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G = E[(6  - 6) 'M(6  - 6)],  (4) 

for  nonnegative  definite  matrices  M.  The  addition  of  the  matrix  M in  the 
definition  of  generalized  mean  squared  error  allows  a researcher  the 
flexibility  of  stressing  specific  linear  combinations  or  individual  elements 
of  6 more  heavily  than  others  in  the  computation  of  G. 

Theobald  (1974)  showed  that  a range  of  values  of  k,  the  ridge  param- 
eter, exists  which  guarantees  that  the  ridge  estimator  has  smaller  general- 
ized mean  squared  error  than  least  squares  for  all  choices  of  M.  He  also 
found  sufficient  conditions  for  the  ridge  estimator  to  have  smaller  general- 
! ized  mean  squared  error  than  least  squares  for  all  M.  The  implication  of 

' this  result  is  that  if  the  ridge  parameter  satisfies  the  sufficient  con- 

ditions, all  linear  combinations  of  the  regression  coefficients  can  be 
estimated  with  smaller  mean  squared  error  using  ridge  regression  than  least 
squares . 

Extending  Theobald's  results,  Gunst  and  Mason  (1976)  derived  sufficient 
conditions  for  the  principal  component  estimator  to  have  smaller  generalized 
; mean  squared  error  than  least  squares  for  all  M.  There  is  not  an  existence 

i theorem  similar  to  that  for  ridge  regression  which  guarantees  that  a prin- 

cipal component  estimator  can  always  be  found  which  has  smaller  generalized 

i 

I mean  squared  error  than  least  squares  for  all  choices  of  M.  Gunst  and  Mason 

I 

(1976)  also  show  that  neither  principal  components  nor  ridge  regression 
dominates  the  other  in  generalized  mean  squared  error  for  all  choices  of  M. 
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Box  and  Draper  (1959)  stimulated  a series  of  papers  by  several  authors 
on  the  use  of  integrated  mean  squared  error  as  a criterion  for  evaluating 
various  designs  for  fitting  response  surface  models.  Integrated  mean  squared 
error  is  defined  as 

A M 

J = /•••/  E{ (Y(u)-E[y (u)] ) } W(u)du.  (5) 

As  defined  in  (5) , integrated  mean  squared  error  incorporates  the  mean 
squared  error  of  the  prediction  equation  at  a point  u e R,  weighted  by  a 
function  W{u),  and  integrated  over  a region  R of  interest  to  the  researcher. 
Since  the  researcher  can  define  a weight  function  and  region  of  interest  to 
suit  the  purpose  of  his  investigations,  integrated  mean  squared  error  offers 
the  type  of  generality  and  flexibility  in  the  evaluation  of  prediction 
equations  as  generalized  mean  squared  error  does  for  regression  estimators. 

Gunst  and  Mason  (1979)  adopted  integrated  mean  squared  error  as  a 
criterion  for  the  assessment  of  prediction  equations  for  which  the  matrix 
of  predictor  variables  is  fixed  but  one  can  choose  which  regression  estimator 
to  use.  Specifically,  integrated  mean  squared  error  was  employed  to  evaluate 
competing  prediction  equations  based  on  least  squares,  principal  component, 
and  ridge  regression  estimators  when  the  matrix  of  predictor  variables  is 
multicollinear . We  now  wish  to  identify  conditions  for  which  the  two  model- 
fitting criteria  are  equivalent. 

2.  AN  EQUIVALENCE  RELATIONSHIP 

Helms  (1971)  shows  that  integrated  mean  squared  error,  J,  is  affected 
by  the  weight  function  and  the  region  of  interest  only  through  the  second 


order  moment  matrix 
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t “ /• • •/  uu'W (u)du 
R 


(6) 


We  now  propose  the  following  theorem. 

Theorem;  If  a multiple  linear  regression  model  is  correctly  specified 
as  in  (1), 

J = n”^o^  + G 

when  M = j;. 

Proof:  J = /•••/  E{(Y(u)-E[Y(u)l)^}  W(u)du 

= £{/•••/  (Y  - W(u)  du} 

R 0 - - 

+ 2E{/”*/  (Y  - 6Q)(u'i  - u'6)  W(u)du} 

+ e{/***/  (u'8  - u'e)^W(u)du) 

R ~ " 

= n'^o^  + E{ (6  - 0)'t(g  - 6)}. 


An  important  consequence  of  this  theorem  is  that  the  existence  and 
sufficient  conditions  established  by  Theobald  (1974)  for  the  ridge  estimator 
to  have  smaller  generalized  mean  squared  error  than  least  squares  for  all 
choices  of  M now  guarantee  that  under  those  same  conditions  the  integrated 
mean  squared  error  of  the  ridge  prediction  equation  is  less  than  that  for 
least  squares  with  all  choices  of  W(u)  and  R that  produce  nonnegative 
definite  second  order  moment  matrices.  Similar  equivalence  relationships 
hold  for  the  generalized  mean  squared  error  properties  established  in  Gunst 
and  Mason  (1976). 
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